Speaker-invariant suprasegmental temporal features in normal and disguised speech
نویسندگان
چکیده
منابع مشابه
Speaker-Invariant Features for Automatic Speech Recognition
In this paper, we consider the generation of features for automatic speech recognition (ASR) that are robust to speaker-variations. One of the major causes for the degradation in the performance of ASR systems is due to inter-speaker variations. These variations are commonly modeled by a pure scaling relation between spectra of speakers enunciating the same sound. Therefore, current state-of-th...
متن کاملCharacterization of Temporal and Acoustic Parameters for Speaker Identification in Disguised Speech
With the fast growth of technology and communication, the criminal activities are also increasing rapidly. We can see the use of latest technology at every next door, may be for good or evil cause. And the limitations of the stepby-step system are increasingly felt. Criminals are very much aware with the latest technologies and they are always ready to beat the surveillance system. Criminals ar...
متن کاملExploring subsegmental and suprasegmental features for a text-dependent speaker verification in distant speech signals
Existing automatic speaker verification (ASV) systems perform with high accuracy when the speech signal is collected close to the mouth of the speaker (< 1 ft). However, the performance of these systems reduces significantly when speech signals are collected at a distance from the speaker (2-6 ft). The objective of this paper is to address some issues in the processing of speech signals collect...
متن کاملContextual invariant-integration features for improved speaker-independent speech recognition
This work presents a feature-extraction method that is based on the theory of invariant integration. The invariant-integration features are derived from an extended time period, and their computation has a very low complexity. Recognition experiments show a superior performance of the presented feature type compared to cepstral coefficients using a mel filterbank (MFCCs) or a gammatone filterba...
متن کاملIdiosyncratic Intensity Variability in the Speech Signal
presented at Phonetik & Phonologie 10, Konstanz, Germany. http://ling.uni-konstanz.de/pages/conferences/pp10/abstracts/He_pp10.pdf Klatt, D. H. (1980). Software for a cascade/parallel formant synthesizer, Journal of the Acoustical Society of America 67: 971-995. Klatt, D. H. and Klatt, L. C. (1990). Analysis, synthesis, and perception of voice quality variations among female and male talkers, J...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Speech Communication
سال: 2015
ISSN: 0167-6393
DOI: 10.1016/j.specom.2015.10.002